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Abstract 

Background: We are developing the Neurological Disease Ontology (ND) to provide a framework to enable 
representation of aspects of neurological diseases that are relevant to their treatment and study. ND is a 
representational tool that addresses the need for unambiguous annotation, storage, and retrieval of data associated 
with the treatment and study of neurological diseases. ND is being developed in compliance with the Open 
Biomedical Ontology Foundry principles and builds upon the paradigm established by the Ontology for General 
Medical Science (OGMS) for the representation of entities in the domain of disease and medical practice. Initial 
applications of ND will include the annotation and analysis of large data sets and patient records for Alzheimer's 
disease, multiple sclerosis, and stroke. 

Description: ND is implemented in OWL 2 and currently has more than 450 terms that refer to and describe 
various aspects of neurological diseases. ND directly imports the development version of OGMS, which uses BFO 2. 
Term development in ND has primarily extended the OGMS terms 'disease', 'diagnosis', 'disease course', and 
'disorder'. We have imported and utilize over 700 classes from related ontology efforts including the Foundational 
Model of Anatomy, Ontology for Biomedical Investigations, and Protein Ontology. ND terms are annotated with 
ontology metadata such as a label (term name), term editors, textual definition, definition source, curation status, 
and alternative terms (synonyms). Many terms have logical definitions in addition to these annotations. Current 
development has focused on the establishment of the upper-level structure of the ND hierarchy, as well as on the 
representation of Alzheimer's disease, multiple sclerosis, and stroke. The ontology is available as a version-controlled 
file at http://code.google.eom/p/neurological-disease-ontology along with a discussion list and an issue tracker. 

Conclusion: ND seeks to provide a formal foundation for the representation of clinical and research data pertaining 
to neurological diseases. ND will enable its users to connect data in a robust way with related data that is 
annotated using other terminologies and ontologies in the biomedical domain. 



Background 

Neurology is concerned with diseases related to the func- 
tioning of the nervous system. These diseases may present 
acutely or chronically, have transient or progressive courses, 
and affect a variety of anatomical regions and cell types. 
They are realized through diverse mechanisms, including 
cell-autonomous disorders, unregulated protein aggregation, 
autoimmune conditions, or vascular pathology [1,2]. There 
are a variety of ways to classify neurological diseases, such as 
by symptomology or pathology. Several classificatory sys- 
tems and terminologies are currently available, such as 
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NIFSTD, ICD-10, SNOMED CT, MeSH, and the Disease 
Ontology [3-7]. Although some of these classificatory sys- 
tems and terminologies are widely used for purposes such 
as billing and medical messaging, they do not satisfy current 
best practices in ontology development and do not provide 
the level of detail needed for precise annotation of and rea- 
soning over data. 

Definitions for terms in medical terminologies are often 
ambiguous or vague, and often the meanings of such 
terms are defined for use only in one particular domain. 
Clinical or research data is thus rarely encoded in a way 
that will allow the linking of various types of data together 
in a coherent fashion that will also support computation. 
Yet, computer-aided reasoning has become increasingly 
important to medical research due, in part, to the vast 
amount of data being generated [8]. This means that it is 
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more important than ever for data to be annotated in a 
clear and unambiguous manner in order to facilitate inte- 
gration across diverse sources and thereby maximize the 
benefit of scientific investigation. The Open Biomedical 
Ontology (OBO) Foundry promotes the development of 
consistent formal ontologies based on a common upper- 
level reference ontology to address this need [9]. The suc- 
cess of the Gene Ontology has shown how a controlled 
and properly curated ontology can benefit and extend re- 
search in medicine [10]. 

The Neurological Disease Ontology (ND) is being de- 
veloped as an extension of the Ontology for General 
Medical Science (OGMS). OGMS represents entities in 
the domain of medicine and disease and addresses the 
need to integrate biomedical data [11,12]. The OGMS 
framework consists of approximately 100 terms that de- 
scribe fundamental aspects of medicine, such as 'dis- 
order', 'diagnosis', 'disease course', clinical encounter' and 
syndrome'. OGMS utilizes a template for generating un- 
ambiguous textual and formal logical definitions for 
terms in an effort to promote the integration of scientific 
data. ND seeks to produce definitions in a similar man- 
ner within the domain of neurological disease. 

We are developing the Neurological Disease Ontology 
to provide a framework for the representation of key as- 
pects of neurological disease. ND is compliant with 
OBO Foundry principles and builds upon the paradigm 
established by OGMS. ND is an ongoing collaborative 
project that aims to establish a formal structure to en- 
able precise representation of a variety of neurological 
diseases and disorders. Our ultimate goal is to accurately 
represent for each disease its molecular, genetic and en- 
vironmental origins, the processes involved in its eti- 
ology and course of progression, as well as its clinical 
presentation and phenotypes, including associated signs, 
symptoms, syndromes, diagnostic criteria, treatment, and 
testing methods. ND has three initial areas of focus: 

1. Alzheimer's disease and diseases resulting in 
dementia 

2. multiple sclerosis and demyelinating diseases 

3. stroke and cerebrovascular events 

Construction and content 

Development of ND 

ND is being developed using both a top-down and 
bottom-up approach to term creation. We use the top- 
down approach to create high-level classes in ND by 
analyzing the types of neurological diseases presented in 
clinical literature and determining how to classify them 
within the ontology. This approach involves determining 
what additional core entities should be part of the ND 
framework in order to allow for a more complete repre- 
sentation of the domain, including relationships between 



upper-level classes. NIF_Dysfunction was used as start- 
ing point for developing the class hierarchy for 'neuro- 
logical disease' in ND [13]. The bottom-up approach 
involves reviewing primary research articles, review arti- 
cles, texts and other sources to inform the development 
of ND. Domain experts and clinical collaborators provide 
constructive feedback and guide decision making on con- 
troversial material. We examine samples of available data 
sources, such as forms for recording clinical history, func- 
tional assessments, and diagnostic charts, to let the data 
inform term creation and refinement. This approach has 
provided the majority of classes and definitions in ND. 

The ontology currently contains approximately 450 
ND classes. Primary work has been done under the 
OGMS classes 'disease', 'diagnosis', 'disease course', and 
'disorder'. Figure 1 shows a subset of classes illustrating 
a portion of the asserted is_a hierarchy between ND, 
OGMS, and the Basic Formal Ontology (BFO) [14]. In 
particular, we have built a large hierarchy of subclasses 
of 'neurological disease' as illustrated in Figure 2. The 
182 subclasses of 'neurological disease' are further subdi- 
vided into additional classifications such as 'neurodegen- 
erative disease', 'demyelinating disease', and 'autoimmune 
neurological disease'. We assert a single inheritance hier- 
archy of disease and capture the complexity of many 
neurological diseases through the use of logical definitions 
that capture additional disease characteristics as onto- 
logical differentia (see below). We have also imported over 
700 classes from external ontologies, such as the Protein 
Ontology (PR), Foundational Model of Anatomy (FMA), 
and Ontology for Biomedical Investigations (OBI) [15-17]. 
ND terms are annotated with ontology metadata such as a 
label (term name), term editors, definition, definition 
source, curation status, and alternative terms (synonyms). 
Currently there are over 300 external references to other 
ontologies and vocabularies. Figure 3 shows a sample of 
the annotations for a term in ND. 

We attempt to provide both textual and logical defini- 
tions for every class. A logical definition provides a set 
of formal axioms specif)^ing the connections between re- 
lated ontology classes, enabling formal reasoning using 
the ontology. These are essential in order to provide 
meaningful results when performing analysis on data an- 
notated using ND. Currently, approximately 50% of the 
classes in ND have logical definitions. Textual defini- 
tions attempt to capture the logical definitions using nat- 
ural language and provide additional clarification when 
needed. Editor notes are used to offer alternative defini- 
tions, indicate gaps in current knowledge, or provide fur- 
ther elucidation beyond that of the textual definition. 
We adhere to the principle of ontological realism in de- 
veloping ND [18]. Wherever possible, terms in ND refer 
to universal types. Some ND terms, such as 'vascular 
disease resulting in dementia', have been created for 
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Figure 1 Graphical overview of high level terms in ND. A subset of classes in ND, showing the is_a relationships between BFO (in black), 
OGMS (in blue) and ND (in purple). 



organizational purposes and are identified as such in 
their annotations. 

ND is being developed using Protege 4.x and imple- 
mented in OWL 2 [19,20]. The development versions of 
OGMS (http://purl.obolibrary.org/obo/ogms/dev/ogms. 
owl) and BFO 2 (http://purl.obolibrary.org/obo/bfo.owl) 
provide the upper-level classes and relations for ND 
[21]. ND imports classes from other ontologies following 
the MIREOT standard using OntoFox [22]. All relation- 
ships discussed are type-type relationships of the form 
"X RELATION some Y", such that for every instance x 
of the type X, x necessarily must stand in relation to 
some instance y of the type Y. 

ND as an extension of OGMS 

In addition to classif)^ing, naming, and defining diseases, 
development of ND requires representation of entities 



related to diseases, such as the disease course and symp- 
tomology. ND is being developed as an extension of 
OGMS, which provides essential high-level terms in- 
cluding 'disease', 'disorder', and 'disease course'. OGMS 
defines 'disease' as a disposition of an organism to 
undergo pathological processes that exists because of one 
or more disorders of that organism (OGMS_0000031). A 
disposition is a realizable entity that inheres in a material 
entity and will manifest particular behaviors given certain 
operative conditions. Thus, diseases are propensities for 
certain pathological processes to occur as a result of a ma- 
terial disorder in an organism. Figure 2 shows the class 
hierarchy for ND 'neurological disease', which is a subclass 
of OGMS 'disease'. 

In general, we create textual definitions for terms 
based on the Aristotelian format according to which 
each class is defined as a member of its parent class plus 
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C 'autoimmune neurological disease' 

► # 'autonomic nervous system disease' 
C 'central nervous system disease' 

▼ Q 'demyelinating disease' 

▼ @ 'autoimmune demyelinating disease' 

>■ # 'acute disseminated encephalomyelitis' 
# 'acute hemorrhagic leukoencephalitis' 
# 'diffuse cerebral sclerosis of Schilder' 
T # 'multiple sclerosis' 

% 'primary progressive multiple sclerosis' 
# 'relapsing multiple sclerosis' 
# 'neuromyelitis optica' 
► ^ 'transverse myelitis' 
9 'central pontine myelinolysis' 
>■ # 'genetic demyelinating disease' 

> % 'infectious demyelinating disease' 
>■ # polyradiculoneuropathy 

# 'subacute combined degeneration' 

► © 'dbease resulting in dementia' 

► # 'epilepsy disease' 

€ 'genetic neurological disease' 

► C 'infectious neurological disease' 
>■ C leukodystrophy 

>■ % 'metabolic nervous system disease' 
# 'neoplastic neurological disease' 

▼ O 'neurodegenerative disease' 

> O 'alpha synucleinopathy' 

▼ Oamyloidopathy 

▼ O'Alzheimers disease' 

► C 'atypical Alzheimers disease' 
^ Q 'genetic Alzheimers disease' 
O 'typical Alzheimers disease' 
@ 'cerebral amyloid angk>pathy' 
>■ ©'Huntingtons disease' 

> © 'motor neuron disease' 

> # 'nervous system paraneoplastic syndrome* 

>■ @ 'neurodegenerative disease resulting in dementia' 
# 'neuronal ceroid lipofuscinosis' 

# 'neuronal intranuclear hyaline inclusion disease' 
^ ©tauopathy 

O 'normal pressure hydrocephalus' 

€ 'paraneoplastic neurological disease' 
>■ € 'peripheral nervous system disease' 
>■ f 'vascular neurological disease' 

Figure 2 Classification of neurological diseases. Screen shot 
from Protege showing neurological disease hierarchy. 



a differentiating criterion. The following definitions of 
core ND terms exemplif)^ this approach: neurological 
disease' is "A disease that has material basis in a neuro- 
logical disorder and, when realized, the resultant patho- 
logical processes affect functioning of the nervous system" 
(ND_0000001); 'demyelinating disease' is "A neurological 
disease that is characterized by loss or dysfunction of 
myelin in the central or peripheral nervous system" 
(ND_0000024); 'neurodegenerative disease' is "A neuro- 
logical disease that is characterized by atrophy or death of 
neurons or related structures progressively affecting the 



functioning of the nervous system" (ND_0000113); 
'neurological disorder' is "A disorder that is the mater- 
ial basis of a neurological disease" (ND_0005501). 

OGMS represents the manifestation of a disease as a 
complex set of interrelated entities using multiple onto- 
logical classes. One area of development in ND has been 
the extension of OGMS 'disease course', which is de- 
fined as "the totality of all processes through which a 
given disease instance is realized" (OGMS_0000063). In 
developing ND, parts of certain disease course types are 
specified and defined. For example, an instance of 'amy- 
loidopathy disease course' is the realization of an in- 
stance of 'amyloidopathy disease'. Thus, the definition of 
ND 'amyloidopathy disease course' (ND_0006031) in- 
cludes the pathological process 'amyloid beta protein 
aggregation'. The formal definition for 'amyloidopathy 
disease course' contains this necessary condition: 

'amyloidopathy disease course' has_occurrent_part 
some 'amyloid beta protein aggregation' 

To connect this class to other classes in the ontology 
additional conditions are asserted: 

'amyloidopathy disease course' realizes some 
'amyloidopathy' 

'amyloidopathy disease course' 

has_participant_at_all_times some ('extended organism' 
and bearer_of_at_all_times' some 'amyloidopathy') 

For example, any process that instantiates 'Alzheimers 
disease course', a subtype of 'amyloidopathy disease 
course', is automatically related to an instance of 'Alzhei- 
mers disease', the organism participating in this process, 
and the material basis. Figure 4 illustrates and elaborates 
on these connections. 

ND distinguishes syndromes from the diseases that pro- 
duce them. OGMS defines 'syndrome' as "A pattern of signs 
and symptoms that typically co-occur" (OGMS_0000086). 
Syndromes, as we interpret them according to a commonly 
held view, share observable clinical presentations - features 
that are often recognizable as being related and predictive. 
Yet syndromes do not always have common underlying 
causes and more than one disease type may result in in- 
stances of the same syndrome. For example, we classify de- 
mentia as a syndrome and not a disease. ND 'dementia is a 
subtype of ND 'mild cognitive impairment' and is defined as 
"The co-occurrence of cognitive deficits in at least two do- 
mains [memory, visual spatial, attention, orientation, execu- 
tive function, functional] resulting in a decline of previous 
functioning" (ND_0003000). Dementia can result from nu- 
merous diseases including Alzheimer's, Parkinson's, and 
Lewy body disease. 
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Annotations 

label 

leukodystrophy 

'alternative term' 
dys myelinating disease 

definition 

A neurological disease which is realized by some dysmyelinating disease course. 

'definition source' 

http://en.wikiDedia.org/wiki/Leukodystrophy 

'definition source' 

http://ulf.orq/what-is-leukodystrophy 
'editor note' 

Discussion is required to determine the status of the leukodystrophies as asserted types of neurological diseases or 
whether they are best left as inferred types. 

'editor note' ^ 

The distinguishing features of the leukodystrohies, aka dysmyelinating diseases, that separate them from 
demyelinating disease are that (1) the pathological processes invloved prevent the normal formation and maintence of 
myelin and (2) the eitology of the disease is hypothesized to be exclusively genetic. 
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Mark Jensen 





Figure 3 Annotations of ND terms. Screen shot from Protege showing annotations for the ND class 'leukodystrophy'. 



A syndrome is a BFO 'quality' and as such must be 
borne by a material entity, e.g., an organism. The organ- 
ism is also the bearer of the disease that gives rise to the 
pathological processes that produce signs and symp- 
toms, some of which constitute the syndrome. A com- 
plex set of axioms is required to relate a disease to a 
syndrome that results from the disease. ND introduces 
and defines 'results in' as a shortcut relation [23,24] to 



capture the disease-to-syndrome link. ND results in' has 
domain 'disease' and range syndrome' and is defined as: 

'disease' D resultsjn 'syndrome' S = def D 
is_realized_by 'disease course' C and C 
has_specific_dependent_at_some_time S and C 
has_participant 'organism' O and O is_bearer_of 
(D and S) 



OGMS: extended 

organism 
^ 



OGMS:disease course 







ND:amylc 
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OGMS: disease 
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> * ' 
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Figure 4 Ontological representation of amyloidpathiy and Alzhieimer's disease. Solid black arrows indicate is_a relationship 
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Support for inferential reasoning in ND 

The use of consistent logical definitions allows add- 
itional hierarchies and relationships between terms to be 
inferred via automated reasoning. In developing ND, we 
chose to construct the asserted disease hierarchy accord- 
ing to the primary mechanism for each disease. This re- 
flects our understanding of diseases and their various 
axes of differentiation based on current methods in 
medicine, such as advanced imaging techniques and 
genetic research. Alternatively, the asserted hierarchy 
could have been developed based on criteria such as the 
source of the disease (e.g. genetic vs. acquired diseases), 
the affected physiological region(s) (e.g. according to 
brain regions or central vs. peripheral parts of the ner- 
vous system), or the resulting symptomology. Many dis- 
eases also have idiopathic or secondary mechanisms, 
such as infections or environmental exposures that may 
precede certain cases of multiple sclerosis [25]. Since we 
do not allow for multiple inheritance in the construction 
of the ontology, we encode secondary differentia via 
carefully defined axioms, so that with the assistance of 
logical reasoners, we may query for information about, 
for example, diseases that have a material basis in the 
spinal cord or diseases that have infections associated 
with their etiology. 

ND 'Alzheimers disease' is "An amyloidopathy where 
cognitive deficits [memory, visual spatial, attention, orien- 
tation, executive function, functional] occur that represent 
a decline from previous levels of functioning resulting in 
dementia caused by neurodegeneration as a result of de- 
position of amyloid plaques and neurofibrillary tangles in 



A 



the medial temporal lobes" (ND_0000152). Alzheimers 
disease' is thus considered to be a type of amyloidopathy . 
This reflects the most current theories about the under- 
lying disorder for Alzheimer s disease (AD). AD is a mem- 
ber of a family of diseases that result from the unregulated 
accumulation of amyloid protein. However, a significant 
part of the material basis of AD is also neurofibrillary tan- 
gles; also called tau tangles [26,27]. It shares this in 
common with other tauopathies. To obtain the in- 
ferred relationship of Alzheimers disease' also being a 
kind of 'tauopathyl the logical specification for Alzheimers 
disease' has the necessary condition: 

Alzheimers disease' (has_material_basis_at_all_times 
some amyloid beta plaque') and 
(has_material_basis_at_all_times some neurofibrillary 
tangle') 

All tauopathies have material basis in some neurofib- 
rillary tangles and when a reasoner is run on the ontol- 
ogy, ND Alzheimers disease', along with any disease 
class that has an asserted material basis of neurofibrillary 
tangles, will be inferred to be a 'tauopathy' in addition to 
its asserted parent. Figure 5 shows the asserted vs. in- 
ferred hierarchy for ND Alzheimers disease'. Other ex- 
amples of alternate ways of classifying diseases through 
inference are ND central nervous system disease' or ND 
'disease resulting in dementia'. No disease types are 
asserted to be children of these classes; rather, these classes 
are used for reasoning purposes. They are organizational 
terms that provide another way of classifying diseases and 
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'Alzheimers disease' 




Figure 5 Asserted vs. inferred hierarchy for Alzheimer's disease. (A) asserted hierarchy for 'Alzheimers disease' and (B) inferred hierarchy for 
'Alzheimers disease' where the is_Q links to 'tauopathy' and 'disease resulting in dementia' are inferred. 
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related entities. As discussed below, adding terms such as 
these to the ontology will aid in developing queries and 
allow novel questions to be asked using the ontology for 
our use cases. 

Utility and discussion 

ND is developed as a collaborative project, both for clin- 
ical applicability and for coordination and unification of 
a variety of biomedical ontologies for research purposes. 
Many disease ontologies exist that offer unique repre- 
sentations of a single disease or family of diseases. This 
has produced overlapping representations. For example, 
some neurological diseases are also infectious diseases. 
By adhering to the OGMS framework, ND is consistent 
with and is developed collaboratively with other ontol- 
ogies using OGMS. These include the Infectious Disease 
Ontology (IDO), Mental Functioning (MF) and Mental 
Disease (MD) Ontologies, Adverse Event Reporting 
Ontology (AERO), and Ontology of Medically Related 
Social Entities (OMRSE) [28-31]. For example, we have 
proposed that our relation 'results in' - used to con- 
nect a disease to a syndrome - be added to OGMS so 
that it is readily available for use by other OGMS- 
compliant disease ontology efforts. 

In addition to consistently classifying neurological dis- 
eases according to their primary disease mechanism and 
then identifying the relevant associated entities following 
the OGMS model, we have sought to define the relation- 
ships that hold between these entities using formal lo- 
gical definitions for each class in ND. This will facilitate 
integration of data resources and provide accurate anno- 
tation and advanced querying capabilities across data 
from a variety of domains within the general field of 
neurology. This includes, for example, relating genomic 
data to patient reported outcomes and cognitive testing 
in the treatment and study of multiple sclerosis or Alz- 
heimer's disease, as well as providing the proper formal 
framework to allow for logical reasoning. Reasoning with 
datasets through the use of logical relations is a capabil- 
ity that other ontologies in the domain of neurological 
disease have not fully explored. Other efforts exist that 
have used ontologies to annotate and query over data- 
sets, such as the Sleep Domain Ontology and Epilepsy 
and Seizure Ontology [32,33]. However, these ontologies 
focus on entities that are specific to their respective do- 
mains rather than on neurological diseases in general. 
Although these ontologies have proven valuable for their 
specific use cases, they differ from our goal to develop a 
domain-encompassing ontology for neurological diseases 
that utilizes relations between the disease classes and re- 
lated classes to maximize the potential reasoning power. 
ND is being built to provide both an accurate description 
of neurological diseases as well as to serve clinical and re- 
search purposes through annotation and reasoning. A key 



developmental goal is to create a more thorough repre- 
sentation of neurological diseases than can be expressed 
using only an is_a hierarchy. This is one way in which 
we build upon prior efforts in this area, such as NIF_- 
Dysfunction [3]. 

A key decision during the initial phase of development 
regarded the extent to which the NIF_Dysfucntion hier- 
archy should be maintained or altered in ND. While we 
believe it is important to utilize existing work as much 
as possible, we chose to not directly import NIF_Dys- 
function for several reasons. Generally we found the 
NIF_Dysfunction hierarchy in need of further curation. 
More specifically, there were several inconsistencies in the 
hierarchy that made direct use problematic - especially 
given our commitment to develop a comprehensive 
OGMS-based picture of neurological disease. Distinctions 
between upper-level classes in NIF_Dysfunction are made 
based on disease mechanism, while lower-level distinc- 
tions are made based on symptomology [34]. Ambiguous 
uses of the terms 'disease', syndrome', and 'disorder' exist 
throughout NIF_Dysfunction. For example, the term 
'Cerebrovascular Disorder' is a subclass of 'Nervous sys- 
tem disease! but the children of 'Cerebrovascular Dis- 
order; such as 'Brain Ischemia' and 'Cerebral Hemorrhage', 
are not diseases. There is also a term for 'Nervous System 
Trauma' under 'disease'. A clear distinction needs to be 
made between physical abnormalities and injuries versus 
diseases. Also, many disease types in NIF_Dysfunction are 
better understood as symptoms or signs, like 'paralysis', 
'paresis', and all of the subtypes under 'dyskinesia'. We have 
held preliminary discussion with NIF_Dysfunction devel- 
opers to determine whether they would alter their hier- 
archy to match ours or develop bridging axioms to 
connect the two ontologies. The goal is not to replace, but 
rather to connect to and integrate various disease ontol- 
ogies such as NIF_Dysfunction and the Disease Ontology 
(DO), as well as related ontologies such as the Phenotypic 
Quality Ontology (PATO) [35,36]. These ontologies all use 
BFO for their upper level terms and adhere to OBO Foun- 
dry principles, yet offer differing classifications of diseases 
and related entities. 

In addition to providing the framework for clear anno- 
tation and reasoning over data, the approach for repre- 
senting disease we adapt from OGMS resolves a great 
deal of ambiguity. For example, terms in different vo- 
cabularies often describe unique disease types for en- 
tities that are better understood as a stage or part of a 
disease course. A common error made by multiple scler- 
osis (MS) terminologies is to assert the existence of 
unique disease types for the clinical variants of MS: re- 
lapsing remitting multiple sclerosis (RRMS), secondary 
progressive multiple sclerosis (SPMS), and primary pro- 
gressive multiple sclerosis (PPMS). Research into this 
domain strongly suggests that these variants are distinct 
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phases in the realization of MS and are not themselves 
unique types of disease [37,38]. These variants are dis- 
tinguished by their unique patterns of episodes of wors- 
ening neurological functioning both in terms of duration 
and intensity, which can be charted over time. Disability 
due to MS is measured according to the Kurtze Ex- 
panded Disability Status Scale (EDSS), which produces 
an identifiable and unique pattern of neurological dis- 
ability. For example, RRMS is a part of the relapsing 
multiple sclerosis disease course that is specified by a 
particular relapsing-remitting pattern of neurological 
disability, which can be specified in a formal manner. 
When a clinical variant of MS is referred to in a dataset, 
say, as an instance of a diagnosis of RRMS, we annotate 
it in ND by generating an instance of 'diagnosis of re- 
lapsing remitting multiple sclerosis', which is about an 
organism that bears the disease relapsing multiple scler- 
osis and participates in a relapsing multiple sclerosis dis- 
ease course. Figure 6 illustrates this by showing the 
relationships that exist between instances of classes in 
the ontology whenever an instance of 'diagnosis of re- 
lapsing remitting multiple sclerosis' is created. 

Ongoing applications of ND 

ND is being built with an eye toward the annotation of 
data in the pursuit and support of novel research. In 
support of one use case, we recently received a grant 
from the National Multiple Sclerosis Society to investi- 
gate the relationship between patient-reported outcomes 
and clinical findings for MS as reported in the New York 
State Multiple Sclerosis Consortium patient registry, 
which has been collecting MS patient data for nearly 
two decades [39]. We will be adding additional terms to 
ND to support this application and then use terms from 



ND to annotate de-identified patient records. Using our 
corpus of annotated records, we will be able to query for 
specific subsets of patient records based on criteria such 
as the type of MS disease course or type of treatment. 
We will then look for unexpected co-annotated terms, 
particularly with respect to treatments and clinical out- 
comes. For instance, we will look for unexpected out- 
comes in certain subsets of patients given particular 
treatments or syndromes associated with disease courses 
that share certain pathological processes. 

We are also collaborating with our clinical partners at 
the University at Buffalo to develop terms in ND in sup- 
port of patient registries for Alzheimer's disease and 
stroke victims. This work has led to the development of 
a corollary project to ND, the Neuropsychological Test- 
ing Ontology (NPT) [40]. NPT is an extension of the 
Ontology for Biomedical Investigations (OBI) [17] that 
represents the kinds of neuropsychological tests used in 
assessing cognitive functioning in patients with a variety 
of diseases, injuries and conditions. One area of future 
development in ND is within the diagnosis branch of the 
ontology. Since neuropsychological tests are used in 
diagnostic processes in neurology, there is need to link 
the two ontologies, ND and NPT, in order to connect 
clinical findings, such as testing results, to diagnostic cri- 
teria for neurological diseases. 

Conclusions 

The Neurological Disease Ontology is being generated 
and curated according to best practices in ontology de- 
velopment. We are adhering to fundamental principles 
of reuse and collaboration to maximize data integration. 
We are providing a set of reference terms applicable to 
any neurological disease as well as developing specific 
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extensions for AD and MS. These extensions provide 
templates for others interested in building additional ex- 
tensions of ND. ND is an important first step toward 
achieving the goal of linking the material basis of neuro- 
logical diseases to their symptomatology as manifested 
in particular patients using a consistent, controlled, and 
unambiguous reference ontology. ND provides an excel- 
lent foundation to represent clinical and research data, 
and formally connects it with related data from other do- 
main terminologies and ontologies across the spectrum of 
biomedical informatics. 

ND is under active development. As we continue to 
work on the ontology, we expect to make many addi- 
tions as well as needed revisions in order to better repre- 
sent the domain. Further work in ND will involve 
continued development of the representation of mater- 
ial bases, treatments, phenotypes, and associated syn- 
dromes of neurological diseases as well as exploration 
of the links between diagnoses, diagnostic guidelines, 
and clinical findings. We welcome collaborators inter- 
ested in developing disease-specific extensions or in 
using ND for annotation and reasoning purposes. 

Availability and requirements 

ND is freely available under the New BSD License 
(Code) and the Creative Commons 3.0 BY License (Con- 
tent) at the following URL: 

http://neurological-disease-ontology.googlecode.com/ 
svn/trunk/src/ontology/ND.owl 
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